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ABSTRACT 

We introduce the new 2-D hexagon technique to probe the topological structure of the universe. The 
2-D matter space has been filled with hexagon unit cells, by detecting whether the cell includes matter 
or not we define the filled cells and empty cells, corresponding to high density region and low density 
region respectively. The number of the filled cells and empty cells are kept same by controlling the 
size of the cells. By analyzing the six neighbors of each hexagon we can get and compare statistical 
topological properties of the high density region and low density region in the universe, in order 
to have a more understanding of the evolution of the universe. We apply this hexagon method on 
2-MASS data and discover the significant topological differences between the high density and low 
density regions, even both of them have significant topological shift from the binomial distribution, 
or the random distribution. 

Subject headings: Cosmology: Observations, Galaxies: Distances and Redshifts, Cosmology: Large- 
Scale Structure of Universe, Methods: Statistical 



1. INTRODUCTION 

Neyman and Scott (1952, 1953b), using statistical 
techniques based on counts-in-cells, showed that the 
galaxy distribution is statistically clustered rather than 
statistically uniform, and on the other, Abell actually 
identified from Palomar Sky Survey plates, 2712 op- 
erationally well-defined rich clusters of galaxies (Abell 
1958). To better describe the clustering property, the 
two-point correlation function (Peebles 1973) and contin- 
ual high order correlation functions (Croton et al 2004) 
are widely used. High density regions, such as filaments 
(Wu et al. 2012), superclusters (Connolly et al. 1996) 
and low density regions, such as voids (Danny et al. 
2012), are discovered and well discussed. At the same 
time, people begin to have interest for the comparison 
of high density and low density region. If the early uni- 
verse is grown from quantum fiuctuations, then positive 
and negative fluctuations will have the same probability 
and a sponge-like structure (high density regions have 
the same topology with low density regions) is doomed 
(Gott 1986). With the evolution dominated by gravita- 
tional instability, the sponge-like structure is most likely 
remained whatever the growth stays in the linear regime 
or is damped (Bond & Szalay 1983). 

Gott (1986) found that both two regions have similar 
properties from the measured density contours, such as 
sponge-like shape, connectedness. However, Gott (1986) 
mentioned a direct test more than measuring density con- 
tours is needed. Here we use the hexagon cell method to 
investigate the equivalent of high density and low density 
regions. Our hexagon cell method is inherited from the 
dodecahedron cells (Kiang 2003, Kiang et al 2004); it is 
actually a 2-D application of dodecahedron cells method. 

Different cells have been applied to analyze the topol- 
ogy of galaxy distribution, such as truncated octahedrons 
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(Coxeter 1937; Gott 1967; Gott 1986), Voronoi foams 
(Icke and Weygaert 1987), dodecahedron cells (Kiang 
2004); 2D topological structure of the observed uni- 
verse has been investigated by variety of methods, such 
as genus method (Davies & Coles 1993, Colley 1997, 
Hoyle & Vogeley 2001, Hoyle et al 2002), percolation 
and ?lamentarity (Pandey & Bharadwaj 2008). Colley 
(1997) found that the 2D genus of the Las Campanas 
redshift Survey is very close to a Gaussian random field, 
Hoyle claimed the genus of 2 degree Field Galaxy Red- 
shift Survey team (2DFGRS) and Sloan Digital Sky Sur- 
vey (SDSS) reveals a smaller number of isolated voids 
than clusters (Hoyle & Vogeley 2001, Hoyle et al 2002). 

The motivation of this paper is to investigate the topo- 
logical relation between low density regions (such as 
voids) and high density regions (such as clusters) to get 
more understanding of the early universe. Our article is 
organized as follows. In section 2 we present the Hexagon 
cells method. In section 3 we study the data from the 
2MASS. We summarize the results in section 4 and have 
conclusions in section 5. 

2. HEXAGON CELLS 

If we partition the 2-D space into cells with the shape 
of hexagon, we can get hexagonal lattice. Hexagonal 
lattice is one of the five 2D lattice types and is the only 
lattice which can fill the 2D space inseparably with more 
edges than square. In the natural world, honeycomb is 
an interesting example of hexagonal lattice. For bees, 
hexagon is the most efficient shape for the bees' living 
and storing of food. In the reality, the hexagon structure 
provides the maximum strength and that it is used in 
designing airplane wings and satellites walls. 

The reason of using hexagon cells in this paper is that 
we want to have the direction detection as many as pos- 
sible for our unit cells. Hexagon cells could detect the 
neighbors in six directions and provide us the rich topol- 
ogy information, and at the same time fill in the space 
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r(l,0) r(0,l) r(l,l) t{2.2) t(3.3).. 

Fig. 1. — r(mi,m2) examples. Green hexagons have the same 
property (empty or filled) as the blue hexagon we selected, whereas 
red hexagons have the opposite property. For example, t(3, 3) 
means red hexagons are separated into 3 groups and green hexagons 
are also separated into 3 groups. Here we only show one particular 
configuration of t(1, 1) and t(2, 2) , as there are more possible 
coniiigurations than these, examples. . 

Without overlap or missing region. 

2.1. filled and empty cells 

For an application of hexagon method on galaxy dis- 
tribution, similar with the definition of rhombic cell in 
Kiang (2003), if a hexagon cell contains no galaxies, then 
we call it an empty cell; otherwise, it is a filled cell. Au- 
tomatically empty cells represent the low density region 
and filled cells represent the high density region. By 
this definition, we could discover if the filled and empty 
regions have the same morphology and give us a quanti- 
tative description of the topology. 

2.2. The number of like neighbors ni 

We know the hexagons have 6 neighbors and 6 edges 
shared with neighbors. If two neighbor cells are both 
filled or both empty, then we define their common edges 
as inner edge; otherwise we call it as outer edge. We 
use n to define the number of inner edges. The physical 
meaning of n is to find the same kind of neighbors for each 
hexagon and investigate the aggregation pattern for filled 
and empty cells. In this way n is the approximation of the 
two point correlation function in a specific small scale. If 
filled cells are not distributed in space randomly, we can 
expect the distribution of n will be depart from the well- 
known binomial form, same thing for empty cells. This in 
turn will tell us the morphology similarity and difference 
of high density and low density regions. 

2.3. The Topological Type T{mi,m2) 

Based on the definition of ni , we could divide the 
six edges into inner edges and outer edges two groups 
and consider the connectedness of inner edges or outer 
edges. We use mi to represent the number of connected 
inner edges group and use m2 to represent the number of 
connected outer edges group, that is T(mi, 7712) (see Fig. 
1). So r(0, 1) means a single cell surrounded by 6 unlike 
neighbors (same with ni — 0) and r(l, 0) means a single 
cell surrounded by 6 like neighbors (same with ni — Q). 
t(1, 1) means likely that the cell is on the boundary of 
the clusters or voids, so its two kind of neighbors are con- 
nected separately. r(2, 2) represents a string-like struc- 
ture and the string penetrates the cell and results in two 
groups for each of kind neighbors. r(3, 3) is complicated 
and its special shape possibly implies a mesh structure. 

2.4. The X - typ^ Indices 

To better understand the ni and T(mi,TO2) distribu- 
tion difference between observed and random sample, 
similar with the method mentioned by Kiang (2003), we 
define statistic 

X = (0-i?)//R (1) 




Fig. 2. — 2MASS data and projection scheme, 
(http: / /egsc. usgs.gov/isb/pubs/MapProjections/projections. html ) 

as a measure of the degree to which the observed fre- 
quency of cells exceeds its random expectation for each 
different structure defined by ni and T(mi, 7712). O is for 
observed sample, R is for random sample. 

2.5. Zero offsets of the grid of cells 

To investigate the uncertainties of the results, we use 
same method which Kiang (2004) used in Rhombic Cell 
Analysis. Obviously the present application of hexagon 
cell analysis revealed that ni and T(mi,m2) results 
are highly sensitive to the exact placing of the grid of 
hexagon cells. Because when the galaxies are fixed, if 
the cells are moved it will get same galaxies to lie in dif- 
ferent cells, we call it as the zero offset. We start our 
calculations by placing the centre of our zeroth cell (0, 
0, 0) at galactic coordinates (0,0,0). What we found was 
that if we displace our entire grid of cells by an amount 
up to and including one unit of aO, in any combination of 
the three directions, then the resulting ni and c will gen- 
erally be different, but slightly. If we define the length of 
cell is 1, we simply consider 4 shifts which are (0, 0.5), 
(-0.5,0), (0.5,0), (0.5,0.5). The four shifts will be com- 
bined with the origin gridding to get average ni and c 
value with stand deviation. Generally the shift will re- 
sult in the unequal number of filled and empty cells, but 
it is very tiny (around 1/1,000), moreover, it makes more 
sense for us to keep the length of the cell rather than ad- 
justing it to get equal number of filled and empty cells, 
so we fix the length of cells when we shift. 

3. 2MASS DATA 

The 2MASS (XSC, Jarrett et al. 2000) data we use is 
provided by Cao et al (2006), detailed information can be 
found in their paper. Simply speaking, they select galax- 
ies inside an elliptical isophote with surface brightness of 
20 mag arcsec^ in Ks band. To have 2-D plane data, they 
subject the 2MASS data to an equal-area projection by 
the Lambert azimuthal algorithm: 

Xi = R\/2 — 2\sin{b)\cos{l) 

X2 = Ry^2 - 2\sinlb)\sin{l) ^ ' 

where i? is a relative scale factor, b is the Galactic lat- 
itude and I is the Galactic longitude. This hemisphere 
scheme projects the whole sky into two circular planes, 
a northern and southern sky. We select a square with 
123°. 88 X 123°. 88 in the central part of each circular 
plane. We have two fields of 123°. 88 x 123°. 88 in the 
northern and southern skies. The northern square has 
323,000 galaxies and the southern includes 342,000 galax- 
ies. 

4. RESULTS 

4.1. Binomial distribution results 

Before we analyze the data, we could get the bino- 
mial distribution for reference. The binomial distribu- 
tion gives the discrete probability distribution Pp{n \ N) 
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of obtaining exactly n successes out of N Bernoulli tri- 
als (where the result of each Bernoulli trial is true with 
probability p and false with probability g = 1 — p , as 
given by Eq.4 below:). 



.X 10 



Pp{n I N) = 



n\{N -ny. 



(3) 



Since for any neighbors of a cell, the probability of a 
filled or empty cell is the same (we have the same num- 
ber of filled and empty cells), so p = 0.5 and q — 0.5. 
For an ni distribution, TV = 6 (6 walls for hexagon), 
and n=(0,l,2,4,5,6). From Eq. (4) the probability func- 
tion of a binomial distribution is (1/64, 6/64, 15/64, 
20/64, 15/64, 6/64, 1/64) for n= (0, 1, 2, 3, 4, 5, 6). 
For r(mi,m2) , since this is related to specific arrange- 
ments, it is better to list all combinations and calculate 
the probability than using Eq.(4). Since there are 6 walls, 
each wall could be inner wall or outer wall, so the total 
combinations give N = 2^ — 6A arrangements. If we use 
"1" for inner wall, and "0" for outer wall, we could write 
number 63 with 6-digits binary format, such as 000000, 
000001.... Then the 64 binaries are exactly all possible 
arrangements of r(m 1,7712) . We list the results in Table 
1: 

TABLE 1 





count 


arrangements 


r(0,l) 


1 


000000 


r(l,0) 


1 


mill 


^(1,1) 


30 


000001 000010 000011 000100 000110 000111 
001110 001111 010000 011000 011100 011110 
100001 100011 100111 101111 110000 110001 
001000 001100 011111 100000 110011 110111 
111000 111001 111011 111100 111101 111110 


r(2,2) 


30 


000101 001001 010100 001011 001101 010001 
010100 010110 010111 011001 011010 011011 
100100 100101 100110 101000 101001 101011 
010010 010011 011101 100010 101100 101101 
101110 110010 110100 110101 110110 111010 


r{3,3) 


2 


010101 101010 



In conclusion, the binomial probability function for 
r(mi,TO2) is (1/64, 1/64, 30/64, 30/64, 2/64) for r(0, 1) 
, r(l,0) , r(l,l) , r(2,2) , r(3,3) .We use two Monte 
Carlo simulations to test this. First, a random sample 
including 323,000 points (same size and geometry with 
observed sample) is used; the size of hexagon is adjusted 
to get an equal number of filled and empty cells. Second, 
a random cells distribution (number of filled and empty 
cells are equal, total number is kept the same with the 
first simulation) is directly used. Both distributions give 
almost undistinguishable results as shown in Fig. 3 and 
Fig. 4. This in turn reflects the simulated cell distribu- 
tion and can represent the real point distribution very 
well. Different seeds of random distributions produce al- 
most no difference too. 

4.2. 2MASS results 

After adjusting the length of cells and excluding the 
outermost cells (to avoid boundary distortion), we use 
172,320 fllled and 172,320 empty hexagon ceUs to fiU 
up the 2-D northern sky of 2MASS, which has roughly 
290,000 galaxies included by used hexagons. 
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Fig. 3. — ni results of a Monte Carlo simulation. 
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Fig. 4. — T(r?ii,m2) results of a Monte Carlo simulation. Tian- 
gles reperesent the filled cells and circles represent the empty cells. 
The number of filled and empty cells are kept same by adjusting 
the size of cells. There are actually two kinds of triangle and cir- 
cle from two kinds of Monte Carlo simulations (random cells or 
random points), but they are overlapped each other and undistin- 
guishable). 
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Fig. 5. — ni results of observed sample. 
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The observed sample results of ni and r(r7ii,TO2) 
shown at Fig. 5 and Fig. 6. 

From Fig. 5 and Fig. 6 filled and empty hexagon cells 
appears to have very close ni and T(mi , 7712) distribution, 
but considering the extremely small standard deviations, 
they are actually quite different (the reader may inquire 
whether the Zeroshift technique could be used to get ap- 
propriate standard deviation, but using a set of random 
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Fig. 6. — T(mi,m,2) result of the observed sample. Tiangles 
reperesent the filled cells and circles represent the empty cells. The 
number of filled and empty cells are kept same by adjusting the 
size of cells. 354,560 hexagons are used. This is the average value 
of 10 observed results from the zero offsets of grid cells mentioned 
in Sec 2.5. cr are extremely small (because we used huge number of 
hexagons and galaxies so the standard deviation is around 50 300 
for all Jii and T(m\,m2) ) and cannot be displayed on the figure. 

samples with different seeds would give even smaller stan- 
dard deviations for our ni and r(mi,TO2) distribution). 
Table 2 shows the x results mentioned in Eq. (2) with 
corresponding standard deviations of x result from 10 
Zeroshift grids. Note here we use binomial distribution 
as reference instead of comparing filled and empty cells 
directly, this is because we try to use binomial distribu- 
tion as a "ruler" to give the difference between filled and 
empty cells more physical meaning. 



TABLE 2 

X DISTANCE TO BINOMIAL DISTRIBUTION (NORTHERN SKY) 





Filled 


Standard 


Empty 


Standard 




cells 


deviation 


cell 


deviation 




X filled 


\ filled 


Xempty 


Xempty 


m = 


18.6 


0.97 


14.7 


1.22 


ni = 1 


-1.0 


0.66 


-8.8 


0.34 


ni = 2 


-32.6 


0.68 


-36.2 


0.86 


ni = 3 


-37.7 


0.87 


-30.9 


0.27 


ni = 4 


1.1 


1.13 


12.7 


0.72 


ni = 5 


66.4 


1.44 


63.3 


1.11 


ni = 6 


117.1 


1.97 


76.1 


0.85 


r(0,l) 


18.6 


0.97 


14.7 


1.22 


r(l,0) 


117.1 


1.97 


76.1 


0.85 


r(l,l) 


23.2 


0.99 


20.4 


0.63 


r(2,2) 


-42.3 


0.50 


-33.7 


0.74 


r(3,3) 


-18.7 


1.00 


-16.3 


1.17 



The same method is applied on 2MASS southern sky 
map and gets similar results shown in Table 3. Note here 
we cannot combine 2MASS northern and southern re- 
sults together because there is a slightly structure differ- 
ence between 2MASS southern and northern sky maps. 
For example, t(1,0) is 105.8 in southern sky map but 
117.1 in northern sky map. This is around 5(t difference 
and cannot be neglected. 

Note in Table 2 we see that ni = and t(0, 1) are 
identical, and ni = 6 and r(l,0) are identical. This is 
consistent with what we describe in Sec 2.3. We see that 
the results of ni=l, 3, 4, 6 and r(0, 1), r(2, 2) distribu- 
tion have significant differences between filled and empty 



cells. We also note the follow results from Table 2. 



TABLE 3 

X DISTANCE TO BINOMIAL DISTRIBUTION (SOUTHERN SKY) 





Filled 


Standard 


Empty 


Standard 




cells 


deviation 


cell 


deviation 




\ filled 


\ filled 


Xempty 


Xempty 


ni = 


1 Q c; 
io.O 


1 ov 


loo 


U.oU 


ni = 1 


-1.9 


1.11 


-8.7 


1.18 


ni = 2 


-31.4 


0.96 


-34.0 


1.808 


m = 3 


-35.5 


0.57 


-29.5 


0.55 


ni = 4 


1.7 


1.13 


12.1 


1.22 


ni = 5 


62.3 


1.42 


62.5 


1.62 


ni = 6 


105.8 


1.93 


74.0 


1.40 


r(0,l) 


18.5 


1.27 


12.2 


0.80 


r(l,0) 


105.8 


1.93 


74.0 


1.40 


^(1,1) 


22.7 


1.05 


21.3 


0.92 


r(2,2) 


-41.0 


1.03 


-32.8 


0.79 


r(3,3) 


-18.2 


0.68 


-15.1 


0.5 



• From ni=6 (or r(l,0) ) filled cells have a much 
stronger (21cr for northern sky and 15a for southern sky) 
aggregation property than empty cells. This is consistent 
with Hoyle (2002) who found that the number of isolated 
voids is smaller than the number of clusters in the 2dF 
Survey. 

• The most general case is n = 3 (20/64 probability for 
standard distribution); however, we can see the ni value 
at n = 3 is smaller than standard distribution. This can 
be explained as that aggregation makes filled and empty 
cells have more probability on ni > 3 than standard 
distribution, hence reduce the probability at n = 3. 

• For ni = l, filled cells are closer (12(7 for northern sky 
and 6ct for southern sky) to standard distribution than 
empty cells, which implies that empty cells have less or 
longer fiber structure than filled cells as the end of fiber 
always has ni=l. ni = l must result in a r(l, 1) structure, 
we do not see much difference between filled and empty 
cells for t(1, 1) in Table 1. This suggests ni — 1 structure 
is the minority in most r(l, 1) structures, in other words, 
r(l, 1) includes much more cells on the boundary of filled 
and empty cells than of the end. 

• r(2, 2) has also significant difference (12cr for north- 
ern sky and 8a for southern sky) between filled and 
empty cells. Empty cells have much more frequently 
penetrated by filled cells than filled cells penetrated by 
empty cells, which implies that empty cells have less fiber 
structure than filled cells and is consistent with ni=l re- 
sults. 

5. CONCLUSIONS 

With a direct measurement by the hexagon cell 
method, we found there are significant differences in 
topology between high density regions and low density re- 
gions in 2MASS data. The accuracy is assured by the Ze- 
roshift technique. We found low density regions have less 
fiber structure and are less clustered than high density 
regions to high accuracy. The difference could be caused 
by the evolution, which suggests that evolution might 
include more things other than gravitational instability 
(Bond & Szalay 1983); or caused by the original asymme- 
try of quantum fiuctuation in the early universe (Turner 
1998). Overall speaking, low density regions have slightly 
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closer topology to random distribution than high density 
regions. We also found the topology of 2MASS south- 
ern sky map is slightly closer to random distribution tan 



northern sky. Our work provides a supplement for genus 
analysis, which shows that high density and low density 
regions have almost identical properties (Park et al, 2001; 
Gott 1986) or are sUghtly different (Hoyle et al 2002). 
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